Picture for Yanning Zhang

Yanning Zhang

No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves

Add code
May 05, 2025
Viaarxiv icon

Vision and Intention Boost Large Language Model in Long-Term Action Anticipation

Add code
May 03, 2025
Viaarxiv icon

Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views

Add code
Apr 29, 2025
Viaarxiv icon

FusionNet: Multi-model Linear Fusion Framework for Low-light Image Enhancement

Add code
Apr 27, 2025
Viaarxiv icon

SlowFastVAD: Video Anomaly Detection via Integrating Simple Detector and RAG-Enhanced Vision-Language Model

Add code
Apr 14, 2025
Viaarxiv icon

AVadCLIP: Audio-Visual Collaboration for Robust Video Anomaly Detection

Add code
Apr 06, 2025
Viaarxiv icon

DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding

Add code
Mar 24, 2025
Viaarxiv icon

Boosting HDR Image Reconstruction via Semantic Knowledge Transfer

Add code
Mar 19, 2025
Viaarxiv icon

AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation

Add code
Mar 09, 2025
Viaarxiv icon

HyperGCT: A Dynamic Hyper-GNN-Learned Geometric Constraint for 3D Registration

Add code
Mar 04, 2025
Viaarxiv icon